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Abstract. Consider a randomly oriented graph G = (V, E) and let a, s and h be 
three distinct vertices in V . We study the correlation between the events {a s} and 
{s — > 6}. We show that, when G is the complete graph Kn, the correlation is negative 
for n = 3, zero for n — 4, and that, counter-intuitively, it is positive for n > 5. We 
also show that the correlation is always negative when G is a cycle, Gn, and negative 
or zero when G is a tree (or a forest). 



1. Introduction 

Given a graph G = {V, E) we orient each edge with equal probabihty for the two 
possible directions and independent of all other edges. This model has been studied 
previously in for instance [U [HI [9]. Let a, s and b be three distinct vertices in V . The 
object of this paper is to study the correlation of the two events {a — > s}, that there 
exists a directed path from a to s, and {s — > h}. One might intuitively guess that they 
are always negatively correlated, i.e. that P{a — > s, s ^ 6) < P{a — > s) ■ P{s h). This 
is however not true for all graphs. In fact, the smallest counterexample is the graph on 
four vertices with all edges except {a, 6} present, see Section [2j 

In section [3] we prove that for the complete graph, Kn, the events are negatively 
correlated for n = 3, independent for n = 4 and positively correlated for n > 5. The 
complementary events, A := {a s}, that there does not exist a directed path from 
a to s, and := {s ^ 6} have the same covariance, and we show that their relative 
covariance, {P{A D B) - P{A) ■ P{B))/P{A n B) converges to 1/3 as n ^ oo. 

In Section H] we give exact recursions for the probabilities P{A) and P{A n B), and 
compute these for n < 15. For completeness, in Section [5] we show that the events are 
negatively correlated when G is a cycle and negatively correlated or independent when G 
is a tree (or a forest). We end with stating a number of conjectures and open problems. 

In a coming paper, [T], we will study this problem when G is the random graph G{n,p). 

The question studied here was posed in [H]. There it was proved that under this 
model for any vertices a,b, s,t (z V the events {s a} and {s b} are never negatively 
correlated. This was shown to be true also if we first conditioned on {s t}, i.e. 
P{s a, s ^ b\s ^ t) > P{s a\s ^ t) ■ P{s b\s t). As a sort of converse it was 
also proved that P{s — > a, 6 — > t|s ^ t) < P{s a\s t) ■ P{b — > t\s t). The proofs 
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in [8] relied heavily on the results in [2] and [3], where similar statements were proved 
for edge percolation on a given graph and a result from [9] that relates the random 
orientation with edge percolation. This cluster of questions on correlation of paths 
have been inspired by an interesting conjecture due to Kasteleyn, named the Bunkbed 
conjecture by Haggstrom [6], see also [7] and Remark 5 in [2]. 

Acknowledgment: We thank Svante Janson, Stanislav Volkov and Johan Wastlund 
for fruitful discussions. 

2. A COUNTER-INTUITIVE EXAMPLE 

Let G be a graph with four vertices and all edges except the one between a and b 
present, see Figure [H and let C := {a ^ s} and := {s — > b}. 




Figure 1. A counterexample with positive correlation 



Then, PiC) = P{D) = ^ + | + ^ = § and P(C n i?) = i + ^ + ^ + ^ - ^, 



1 I 1 I 1 I 1 — 7 



that 



P{C n D) - P{C) ■ P{D) 



7 



16 V32 



21 



> 0. 



so 



1024 



Note that if we relabel the vertices in this graph as in Figure O we still get P{C) 
P{D) = §, but P{C n D) = i + i + ^ = ii, so that 

P{Cf^D)-P{C)■P(D) 



4 ' 8 ' 32 ~ 32' 

13 /21^^ 



32 V 32 



25 

1024 



< 0. 



In fact, the labeling in Figure [His the only graph with four vertices that gives a positive 
correlation. 




Figure 2. A slightly modified example with negative correlation 
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3. The complete graph Kn 

Let G be the complete graph, Kn , and orient the edges either way independently with 
probability |. 

For three different vertices, a, s and 6, of Kn we want to know if the event {a s} 
and the event {s — > 6} are positively or negatively correlated. It turns out to be easier 
to study the correlation of the complementary events, i.e. A := {a ^ s}, that there does 
not exist a directed path from a to s, and the event B := {s 6}, which have the same 
correlation. 

Think of the vertices of Kn as [n] := {1, . . . , n}. 

To estimate P{A), the following lemma will be useful. 
Lemma 3.1. For all n > 0, 

«(")^-E(:)E("r)G)""^-©" 

k=l ^ ' m=l ^ ' 

Proof. As a(0) = a(l) = 0, we will assume that n > 2. We will use that {^)^"^ = 
(i)"" • < (I)"* • i^)^-'^ = 4 • (i)'^ • (i)'", if m > 1 and A: > 2, and split the sum 

into two parts, k = 1 and k >2. 

m=l ^ ^ k=2 ^ ^ m=l ^ ^ 

."■©"-^-E(:)©'E(",; 

fc=2 ^ ^ m=l ^ ^ 

^"■(r'--E(:)a)'©"-' 

The lemma follows by showing that n ■ (|)" ^ < 1.6 • (l)'^ holds for all n > 2. □ 

Remark 3.2. As k = 1 contributes n ■ (l)"^"'^, this gives a lower bound for a{n). This 
is in fact the dominating term, so that a{n) asymptotically is of order p{n) ■ (|)", where 
p{n) is a polynomial in n. By splitting the sum into three parts, we can, for example, 
show that a{n) < 13.6 • (^)" for all n > 0. 

Theorem 3.3. For all n>2. 

Proof. A necessary condition for A is that the edge between a and s is directed from s 
to a. Let E, with P{E) = 1/2, denote this event. 

Let Oa and Og denote the sets of points in [n] \ {a, s} that can be reached from a and 
s respectively in one step. Similarly, let and Is denote the sets of points in [n] \ {a, s} 
that can reach a and s respectively in one step. 
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For the lower bound, note that E n {Oa = ^) ^ A and E n {Ig = 9) ^ A, so that 

//In n-2 /In n-2 /In 2n-4\ 

P(A)>P((O, = 0)U(/.=0))/2= (^(-) +(-) -(-) J/2 

For the upper bound, note that A ^ E f] {Oa C Os) H F, where F is the event that the 
points in Oa have no directed edges to points in Is- Note that, if A; = \0a\ and m = |0s[, 
then /c < 77T, is necessary, and there are k{n — 2 — m) edges in F. 

P{A)<P{{OaCOs)nF)/2 



1 ^/n-2\ /l\"-2l^ /n-2-A:\ /Ix'^-s /l-^fc(n-2-m) 
2'^l ^ iU; ^ ( m-k )\2) ■ V2. 



A;=0 m=k 



iyn-3'^/n-2\ (n-2-k\ fl^k-m 

^ fc=0 ^ ^ ^ m=0 



m / \2 



l\2n-3 /n-2\ /l\2n-3:i^ /n-2 

2) ^( ,n j+y , 

m=0 ^ ^ fc=l ^ 



2/ k ^ \ m J\2 

k=l m=l 



1 \ 2n-3 „ / 1 \ 2n-3 

_ 1 on— 2 I /I 

2. 



j . 2"-^ + (^^j . (2^^-^ - 1) 



/l\2n-3!L4 /n-2\ {n-2-k\ /lxfc-"i 

+ (2) E( , j E ( n, 

A:=l ^ ^ m=l ^ 

/\\n~2 /l\2n-3 / 1 \ n-2 / 1 \ 2n-3 / 7 \ n-2 

<(5) +y ■<.(«- 2) <(-) +5^6. (-) (j) 

where the function a was defined in Lemma l3.ll □ 
Theorem 3.4. 



hm 2"^-^ ■ P{A) = 1. 

n— >oo 

Proof. Follows immediately from Theorem 13.31 □ 
To estimate P{Ar\B)^ the following lemma, in combination with Lemma [3 -H is useful. 
Lemma 3.5. For all n > 0, 

fc=l ^ ^ 1=1 ^ ^ m=l ^ ^ 
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Proof. Note first that 6(0) = 6(1) = 6(2) = 0, so that we may assume that n > 3. Note 
also that for k,i>l,ki = {k - - I) + k + i - I > /c + i-l, so that (i)^* < 2(i)*^(i)*. 
This gives 

«")^-e(:)-(^)'"e(":')-(^)' (r'"'E(:) 

k=l ^ ^ i=l ^ ' m=l ^ ' 

fc=l ^ ^ fc=l ^ ^ 

□ 

Theorem 3.6. For all n > 3, 

(2) (^-^2) )£^'(-4nB)<(5) (3 + 20.8. (-) ), 

Proof. A necessary condition for ACiB is that the edge between a and s is directed from 
s to a, that the edge between s and 6 is directed from 6 to s and that the edge between 
a and 6 is directed from b to a. Let E, with P{E) = 1/8, denote this event. 

Let Oq, Os and O;, denote the sets of points in [n] \ {a, s,6} that can be reached 
from a, s and 6 respectively in one step. Similarly, let Iq, Ig and If, denote the sets of 
points in [n] \ {a, s, 6} that can reach a, s and 6 respectively in one step. For the lower 
bound, note that E ^ {Oa = Os = ^) ^ A r\ B , E ^ {Oa = h = ^) ^ A ^^ B said 
E n {Is = Ib = ^) ^ An B, so that 

P{A nB)> P{{Oa = = 0) U {Oa = A = 0) U {Is = h = 0))/8 

.Q-(3_2(i)-). 

as {Oa = = 0) n (/, = 4 = 0) = 0. 

For the upper bound, we note that Ar\B Er\{Oa <Z Og C Oh)riF, where F denotes 
the event that the points in Oa have no directed edges to points in Is and the points in 
Os have no directed edges to points in 4, so that P{AriB) < P{{Oa C Os C Ob)r)F)/8. 
Let k = \Os\, i = \Ib\ and m = \0a\. Then 0<A;<n — 3, 0<i<n — 3 — /c and 
< m < k and the direction of all 3(n — 3) + 3 = 3n — 6 edges connected to a, s and 6 
are determined. Further, the event F determines the direction of ki + m{n — 3 — k — i) 
edges, so that 

n . (1)-- g (" I ^) "e (" - r ^) (i)'- E C) (i)"'"-"-' 

A;=0 ^ ^ •i=0 ^ ^ m=0 ^ ^ 

/ 1 \ 371—6 

= (2) -(51 + 52 + 53 + 54 + 55 + 56 + 57), 

where the triple sum is split into seven parts: 

1: A; = 0+>m = 0, 2: A; = n- 3+>z = 0, 3: i = m = 0,1 < k < n - 4, 
4: i = 0, m > 1, 1 < A; < n - 4, 5: m = 0,i > 1,1 < k < n - A, 
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6: i = n — 3 — kjiTi > 1,1 < k < n — 4, 

7: l<k<n-4,l<i<n-4:-k,l<m<k. 

The first three cases correspond to the three cases of the lower bound, 



i=0 
n—3 



m=0 
n— 4 



_ nn-3 



The next three can be expressed by the function a of Lemma I3.H 

^^=E( , jEy -u) =E( , J E ( ^ 'j-y 

k=l ^ ' m=l ^ ' j=l \ / m=l ^ ' 

= a(n- 3) < 5.6- (-] , 

fc=l ^ ^ j=0 ^ ^ 

^ ^/n-3\ /l\A^-(n-3-fc) ^/n-3\A/A;\ /l^m(n-3-fc) 

k=l ^ ^ m=l ^ ^ k=\ ^ ^ m=l ^ ^ 

n—A / „\ n—3—i / o -x -i ■ o 

E/n-3\ /n-3-i\ /Ix*™ , , /7\'^-3 

( J E ( ,„ ) y =»("-3)<5.(> 

1=1 ^ ^ m=l ^ ^ 

and the last by the function b of Lemma 13. 5| 

o ^ /^n - 3\ Vn - 3 - A;\ fl\>''^fk\ / u m(n-3-fc-i) 
^7 = 2^(^ A: j ^ V ^ ■[2) -[2) 

k=l ^ 1=1 ^ ' m=l ^ ' 

= 6(n-3)<4.(-) . 
Collecting the estimates gives the lemma. □ 
Theorem 3.7. 

lim 22"-3 . p{A nB) = 3. 



Proof. Follows immediately from Theorem 13.61 □ 
Theorem 3.8. 

P(A nB)- P(A) ■ P(B) 1 
hm —7— — = — . 

n^oo P{A nB) 3 

Proof. Follows immediately from Theorems 13.41 and 13.71 as P{B) = P{A). □ 
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Remark 3.9. Note that 



P{AnB)-P{A)-P{B) _^ P{A)-P{B) _^ P{B) 



P{AnB) P{A)-P{B\A) P{B\A)' 

so that Theorem \3.8\ can be formulated as 

P{B) 2 . , , ^'(^1^) 3 

^™ p^p I A\ = i;^ or eqmvalently lim , , = -. 
n^oo F[B I A) 6 n~*oo P[B) 2 

Theorem 13.81 shows that the events A = {a ^ s} and B = {s ^ b} are positively 
correlated for sufficiently large n. From this follows that the complementary events 
C = {a — > s} and D = {s b} also are positively correlated for sufficiently large n. It 
is in fact true for all n > 5 as the next theorem shows. 

Theorem 3.10. The events A = {a ^ s} and B = {s ^ b} are negatively correlated 
for n = 3, independent for n = 4 and positively correlated for n > 5. 

Proof. From Lemmas 13.31 and 13.61 we get 

PiA n B)-P{A) ■ P{B) = P{A DB)- {PiA)f 



> 



:t • 3-2(- - (- • l + 3.2-(-' 



2/ V \2J \ \2 



1 \ 2n-4 / /1\n-3 /7\n-l /4Q\n-l 

>,i) .(6-4.(1) -1-6.4. Q -10.24.(1?^ 



2n-4 

2. 



(5-c(n)), 



where c(n) is a decreasing function of n, with c(8) < 5, so that the theorem holds for 
n > 8. The remaining cases, 3 < n < 7, are proved using the recursion formulas in 
Lemmas 14.11 and 14.21 in the next section. □ 

4. Exact recursions 

In this section we will derive recursions for P{A) and P{A D B). For n > 2, s € [n] 
and K Cl[n\ \ {s} let {K s} denote the event {a ^ s for every a G K^. 
With \K\ = k define 

f{n,k) :=Pn{K^s), 

where in particular /(n, 0) = 1. Also set /(1,0) = 1 for convenience. 
For n > 3 and s,b G [n], K C [n] \ {s, b}, s b and \K\ = k define: 

g{n, k) := Pn{K ^ s,s ^ b), 

where in particular g{n,0) = f{n, 1). Also let g{2,0) := /(2, 1) = 1/2. 

Lemma 4.1. For n > k + 1 > 2 we have 

n—k—l 



1=0 
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Proof. We first deduce the following recursion: 

Lc[n]\{KU{s}) 

This can be seen as follows. We think of the set L as the vertices that we can reach 
from vertices in K in one step. Clearly, we must have s ^ L. Every edge from [n]\(K[JL) 
to K must be oriented in that direction, which gives the first power of 1/2. To reach a 
vertex c € L in one step it must not be the case that all edges {a, c},a € K are directed 
away from c, which gives the second factor. The edges within K make no difference and 
we have considered all edges going between K and [n] \ K. We are left with a situation 
where we must make sure that there is no directed path from any vertex in L to s passing 
over vertices in [n] \ which gives the last term. The recursion in the lemma is easily 
deduced from this, by summing over the size, i = |L|, of L. □ 

Lemma 4.2. For n>A; + 2>3 we have 

g{n,k)= 5^ ^ ^) %n-]J, 9(.n-k,i). 

Proof. First note that {K ^ s} and {s ^ b} implies {K ^ b}. Reasoning as in the 
previous proof, we obtain 

Pn{K^S,S^b)= Yl 2Hn~k-\L\) ~ ^) ' ^ ^^^^ -^S,S^ b). 

Lc[n]\{KU{s}U{b}) 

□ 

Using the lemmas, we can recursively compute the desired probabilities P{A) = /(n, 1) 
and P{A f] B) = g{n, 1). Exact and numerical values of P{A) and P{A fi B) were com- 
puted and are given in Table [T] together with numerical values of the relative covariance 
{P{A nB)- P{A) ■ P{B))/P{A n for 3 < n < 13. 



5. Cycles and trees 

Let G = Cn, the cycle with n vertices, and let c, d and n — c — d denote the distances 
(number of edges) between a and s, s and b, and b and a, respectively. We assume the 
three vertices to be distinct, so that c,d,n — c — d > 1. Further, let C := {a ^ s} and 
D -.= {8^ b}. Then, 




A COUNTER-INTUITIVE CORRELATION IN A RANDOM TOURNAMENT 



9 



where the last term corresponds to the case when there is a directed path s — > a ^ 6 — > s. 
This gives 

P{Cr\D) - P{C) ■ P{D) 

^\c+d f\\^ /l\"-c 



2y +1^2'"''"'+'"' -(-||-|(-|+(- 

-2"(1 - 2"^ - 2"'^) - (2'=+"' - 2^= - 2"^ + 1) 

/ \ 2n 

< _ / _ j . (^n^c+d . (|2C-<i _ 1)2 + (2^ - 1) • (2^ - 1 
-[^\ 2n 



< 

with equahty if and only \i c = d = 1. 
We have proved the following theorem. 

Theorem 5.1. When G is the cyclic graph with n nodes, Cn, the covariance between the 
events {a s} and {s b} is at most — with equality if and only if the vertices 

a and b are adjacent to s. 



For trees the situation is even simpler, as there are no cycles so that there is a unique 
path between any two vertices. Two cases can occur. If the path between a and b passes 



n 


P{A) ■ 2(2) 


P{A) 


p{Ar\B) ■ 2(2) 


P{A n B) 


P{AnB)-P(A)P(B) 
P(Ar\B) 


2 


1 


0.500 








3 


3 


0.3750 


1 


0.125 0000 


-0.125000 


4 


16 


0.250 


4 


0.062 500 


0.000000 


5 


150 


0.1465 


26 


0.025 390 6 


0.154898 


6 


2 504 


0.0764 


272 


0.008300 8 


0.296523 


7 


77472 


0.036 9 


4672 


0.002 2278 


0.387428 


8 


4677904 


0.0174 


139 696 


0.000520 4 


0.416449 


9 


571 023120 


0.008 3 


7 928 624 


0.0001154 


0.401547 


10 


142 058 571 776 


0.0040 


917140 928 


0.0000261 


0.374613 


11 


71626948 215168 


0.002 


220 836 999 808 


0.0000061 


0.355191 


12 


72 752 562 631695 616 


0.0010 


109 473 061398 784 


0.0000015 


0.344 746 


13 


148 346 259 329 909191680 


0.000 5 


110 228 037 783 934976 


0.000000 4 


0.339426 



Table 1. Probabilities and relative covariances 
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s, the paths between a and s and between s and b have no edges in common, so that the 
events {a — > s} and {s — > b} are independent. On the other hand, if the path between a 
and b does not pass s, then the events are disjoint, so that P{{a s} H {s —>■ b}) = 0, 
and the covariance is strictly negative. For a forest, if not aU three vertices are in the 
same tree, then trivially P{{a — >■ s} fl {s — > b}) = and at least one of P{a — > s) and 
P{s — > b) is zero, so that the events are independent. 

Theorem 5.2. When G is a tree (or a forest), the events {a s} and {s — > b} are 

either independent or mutually exclusive. 

6. Open problems and conjectures 

From Theorem 15 . 1 1 and the observations in Section^ we make the following conjecture. 

Conjecture 6.1. For any connected graph G = {V,E) and three distinct vertices a, s 
and b in V ; if s has degree at most two, then the events {a s} and {s — > 6} are 
independent or negatively correlated. 

Any connected simple graph G = (V,E),\V\ > 3 belongs to (at least) one of the 
following classes. 

I For any three distinct vertices a,b,s £ V{G), the events {a s} and {s — > 6} 

are non-positively correlated. 
II There exist three distinct vertices a,b,s € V{G), such that the events {a s} 
and {s — > 6} are negatively correlated and there exist three distinct vertices 
a',b',s' € V{G), such that the events {a' — > s'} and {s' — > b'} are positively 
correlated. Or there exist three distinct vertices a,b,s € V{G), such that the 
events {a s} and {s — > 5} are independent. 
Ill For any three distinct vertices a,b,s € V{G), the events {a s} and {s b} 
are non-negatively correlated. 
We have shown that trees and cycles belong to Class I, Kn,n > 5 belongs to Class 
III and ^4 minus one edge belongs to Class II. Note that when we have independent 
events there may be some overlap between the classes, in particular K4 belongs to all 
three classes. 

Conjecture 6.2. For large n most graphs will belong to Class II. 

In fact we guess that for n large enough, the graphs in Class I are joins (in some 
vague sense) of cycles and trees. It would be interesting if it was possible to characterize 
the graphs in Class I. We formulate the following more specific questions. Recall that 
outerplanar graphs are the graphs that do not have K/^ or as minors. 

Problem 6.3. Are all graphs in Class I (with \V{G)\ >5) outerplanar? 

Similarly one could ask for a characterization of the graphs in Class III. The following 
subproblem would also be interesting if it could be solved. 

Problem 6.4. For a given n, what is the smallest number k such that there exist k edges 
whose removal from Kn gives a graph not in Class III? 

Finally we ask if Class I and Class III are monotone. 
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Problem 6.5. Is it true that if G belongs to Class I, but not to Class II, then so does 
any connected subgraph obtained by removing one edge? Similarly, is it true that if G 
belongs to Class III, but not to Class II, then so does G plus any new edge? 

The results in [1] seem to suggest that Class III is larger than Class I. Is this true? 
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